Function
Current Version: 1.0
Converts post text from Tumblr blog export’s HTML directory into a
.csv file with filename, text, and date for text analysis.
Issues
Major
- Nested reblogs aren’t visually specified
- CSV fields can’t be user specified
Minor
Patch
- Not all HTML characters are being formatted properly
(e.g.
text–dash:)
- Frog image and confirmation button on splash screen aren’t
centered
Started April 8, 2026
README
About
A basic parser for Tumblr’s html blog export.
Outputs in CSV, with columns for filename, post text, GMT date, and
tags.
Usage
- First, export
your Tumblr blog
- Once you have downloaded your export, extract it
- Download and run the FrogLab
Tumblr Parser
- When prompted, select the “html” folder containing your posts
- Select output destination
Limitations in v1.0
- Posts are exported in .csv format, with newlines to separate posts
and tags. Certain versions of Excel might interpret these newlines as
delimiters. Take care when importing data into a spreadsheet program to
avoid errors.
- Reblog chains aren’t visually specified
- Not all HTML characters are formatted correctly
(e.g.
text–dash:)
- CSV columns can’t be user specified